Deep Reinforcement Learning for Crowdshipping Last-Mile Delivery with Endogenous Uncertainty
نویسندگان
چکیده
In this work, we study a flexible compensation scheme for last-mile delivery where company outsources part of the activity delivering products to its customers occasional drivers (ODs), under named crowdshipping. All deliveries are completed at minimum total cost incurred with their vehicles and plus paid ODs. The decides on best offer ODs planning stage. We model our problem based stochastic dynamic environment orders volunteering make present themselves randomly within fixed time windows. uncertainty is endogenous in sense that influences availability. develop deep reinforcement learning (DRL) algorithm can deal large instances while focusing quality solution: combine combinatorial structure action space neural network approximated value function, involving techniques from machine integer optimization. results show effectiveness DRL approach by examining out-of-sample performance it suitable process samples uncertain data, which induces better solutions.
منابع مشابه
Stochastic last-mile delivery problems with time constraints
When a package is shipped, the customer often requires the delivery to be made within a particular time window or by a deadline. However, meeting such time requirements is difficult, and delivery companies may not always know ahead of time which customers will need a delivery. In this thesis, we present models and solution approaches for two stochastic last-mile delivery problems in which custo...
متن کاملTransshipment Networks for Last-Mile Delivery in Congested Urban Areas1
This paper introduces the concept of transshipment networks, a collection of strategically located transshipment platforms, for efficient and flexible last-mile delivery in congested urban areas. By implementing transshipment platforms, logistics operators can select the locations, light-freight vehicle types and operating schedules that best fit specific distribution strategies, and, simultane...
متن کاملMinimizing waiting time in last mile delivery for smart city
Customers consider a reliable and on-time last mile delivery in Omni-channel business as important as price and good’s quality. We focus on minimizing the waiting time for customers and carriers in last mile delivery. We model the problem as an optimization problem and develop Genetic Algorithm to solve it.
متن کاملNaive Reinforcement Learning with Endogenous
This article considers a simple model of reinforcement learning. All behavior change derives from the reinforcing or deterring effect of instantaneous payoff experiences. Payoff experiences are reinforcing or deterring depending on whether the payoff exceeds an aspiration level or falls short of it. Over time, the aspiration level is adjusted toward the actually experienced payoffs. This articl...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Mathematics
سال: 2022
ISSN: ['2227-7390']
DOI: https://doi.org/10.3390/math10203902